Well-Behaved Evaluation Functions for Numerical Attributes

نویسندگان

  • Tapio Elomaa
  • Juho Rousu
چکیده

The class of well-behaved evaluation functions simpliies and makes eecient the handling of numerical attributes; for them it suuces to concentrate on the boundary points in searching for the optimal partition. This holds always for binary partitions and also for multisplits if only the function is cumulative in addition to being well-behaved. A large portion of the most important attribute evaluation functions are well-behaved. This paper surveys the class of well-behaved functions. As a case study, we examine the properties of C4.5's attribute evaluation functions. Our empirical experiments show that a very simple cumulative rectiication to the poor bias of information gain signiicantly outperforms gain ratio.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

On the Well-Behavedness of Important Attribute Evaluation Functions

The class of well-behaved evaluation functions simplifies and makes efficient the handling of numerical attributes; for them it suffices to concentrate on the boundary points in searching for the optimal partition. This holds always for binary partitions and also for multisplits if only the function is cumulative in addition to being well-behaved. The class of well-behaved evaluation functions ...

متن کامل

General and Eecient Multisplitting of Numerical Attributes

Often in supervised learning numerical attributes require special treatment and do not t the learning scheme as well as one could hope. Nevertheless, they are common in practical tasks and, therefore, need to be taken into account. We characterize the well-behavedness of an evaluation function, a property that guarantees the optimal multi-partition of an arbitrary numerical domain to be deened ...

متن کامل

Exact solutions to the focusing nonlinear Schrödinger equation

A method is presented to construct certain explicit solutions to the focusing cubic nonlinear Schrödinger equation on the line. Such solutions involve algebraic combinations of polynomials, trigonometric functions, and exponential functions of x and t. In a particular case, the analytic extensions of such solutions to the entire xt-plane yield soliton solutions where the number of solitons, the...

متن کامل

Anonymization of nominal data based on semantic marginality

Nominal attributes are very common in data sets about individuals, specifically medical data like patient healthcare records. Attributes of this type tend to be sensitive due to their personal nature. If public-use data sets need to be released, e.g. for clinical research purposes, data should be first anonymized. However, since most anonymization methods omit data semantics when dealing with n...

متن کامل

Incomplete Ordinal Information in Value Tree Analysis

In value tree analysis, additive value functions are used to model alternatives' overall values. Diculties in complete evaluation of alternatives' attribute-specic values (scores) and attributes' relative importance have spurred research on the modeling of incomplete preference information. This thesis shows how incomplete ordinal information, captured as ordinal information where a set of rank...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997